Trellis encoded vector quantization for robust speech recognition
نویسندگان
چکیده
In this paper, a joint data (features) and channel (bias) estimation framework for robust speech recognition is described. A trellis encoded vector quantizer is used as a pre-processor to estimate the channel bias using blind maximum likelihood sequence estimation. Sequential constraint in the feature vector sequence is explored and used in two ways, namely, a) the selection of the quantized signal constellation, b) the decoding process in joint data and channel estimation. A two state trellis encoded vector quantizer is designed for signal bias removal applications. Comparing with the conventional memoryless VQ based approach in signal bias removal, the preliminaryexperimental results indicate that incorporatingsequential constraint in joint data and channel estimation for robust speech recognition is advantageous.
منابع مشابه
Low Bit Rate Speech Coding via TCVRQ
We present a new Trellis Coded Vector Residual Quantizer (TCVRQ) that combines trellis coding and vector residual quantization. We introduce new methods for computing quantization levels and experimentally analyze the performances of our TCVRQ in the case of speech coding at very low bit rates. The results obtained show that transparent quantization of Linear Prediction (LP) parameters can be p...
متن کاملQuantization of LSF parameters using a trellis modeling
An efficient Block-based Trellis Quantization (BTQ) scheme is proposed for the quantization of the Line Spectral Frequencies (LSF) in speech coding applications. The scheme is based on the modeling of the LSF intraframe dependencies with a trellis structure. The ordering property and the fact that LSF parameters are bounded within a range is explicitly incorporated in the trellis model. BTQ sea...
متن کاملBlock Constrained Trellis Coded Vector Quantization of LSF Parameters for Wideband Speech Codecs
ETRI Journal, Volume 30, Number 5, October 2008 ABSTRACT⎯In this paper, block constrained trellis coded vector quantization (BC-TCVQ) is presented for quantizing the line spectrum frequency parameters of the wideband speech codec. Both a predictive structure and a safety-net concept are combined into BC-TCVQ to develop the predictive BC-TCVQ. The performance of this quantization is compared wit...
متن کاملLow - Delay Wideband Speech Coding Using a New Frequency Domain Approach
In this paper a new frequency domain approach suitable for low-delay wideband speech coding is proposed. Working in the context of residual speech coders, the proposed technique performs a decomposihon of the D I T of the target vector, (the input speech after the subtraction of the zero input response signal), as the product of the DFT of the impulse response of the LPC synthesis filter and a ...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کامل